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ON A THEOREM IN PROBABILITY. 



BY B. L. DE POEEST, WATEBTOWN, CONN. 

It should be noticed in connection with the first property discussed in 
my paper on polynomials (Analyst, March, 1880), that component ad- 
justment formulas, such as (4) and (11) for example, need not satisfy the 
condition mentioned at page 41, of having the sum of the coefficients I or 1j 
equal to unity, unless the formulas are to be used for adjustments or other 
special purposes. The I and the L may have any values whatever, and the 
application of the two components will still have the same effect as that of 
one resultant formula with coefficients which are those of the product of the 
two polynomials corresponding to the components. So too, if any number 
of components are successively used, and in any order, they are together 
equivalent to a single resultant whose coefficients are those of the continued 
product of the corresponding polynomials. 

The whole property seems to be related to a theorem in probability which 
was first given in its most general form by Lagrange. (See the complete 
edition of his works, Paris, 1868, Vol. II, p. 198.) With some changes of 
notation and method, it may be stated thus. Suppose that observations of 
any class are subject to given errors, or deviations from the true value of 
the observed quantity, and that each of these errors may occur in a given 
number of ways or cases, and that all the ways are equally probable. The 
errors a, b, c, &c., may be essentially either positive or negative, and they 
can occur in a, ^, y, &c., ways respectively. Then in k observations, the 
number of ways or cases in which the algebraic sum of the errors can be v, 
is the coefficient of a;" in the expansion of the polynomial 

{ax" + j8a^ + r^' + &c.)*, (1 ) 

The probability that the sum will be v is therefore the coefficient of x° in 
the expansion of 
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(ax" + fa ^ + 7-g!°+ &c. Y ,„. 

\ a + ^ + y + &(,.) ' ^ ' 
which may be written 

{px" + qx" + rx" + &c.)*, (3) 

where p, q, r, &c., are the probabilities of the occurrence of the errors a, b, 
c, &c,, respectively, in a single observation, and 

p + q + r -{-&(i. = 1. (4) 

The same coefficient of x" in the expansion of (3) is also the probability 
that the error of the mean of the k observations is v-i-k. 

The most probable total effect of the error a, in k observations, is kpa, 
that of b is kqb, that of c krc, and so on, wherefore the most probable alge- 
braic sum of the errors is 

k(pa -{- qb -\- re + &c.). (5) 

and this is approximately the exponent of that power of x whose coefficient 
in the expansion of (3) is a maximum. The most probable error in the 
mean of the k observations is therefore 

pa -\- qb -\- re + &c. (6) 

The above is Lagrange's theorem in substance. An abstract of it may be 
found in Todhunter's History of the Theory of Probability, p. 304. We now 
observe further, that the exponents a, b, c, &c., which represent the errors, 
can be regarded as whole numbers, for if they were fractions, they might be 
reduced to whole numbers by multiplying them all by a common multiple 
of their denominators, which amounts simply to measuring the errors on a 
reduced scale. And the exponents may all be made positive, for if the larg- 
est negative one is — h for example, we can add h to each of them, and so 
convert (3) into the form 

(Po + Pi^ + P22' + • • • + P.X)*, (7) 

where s° and 2"+* take the places of x~^ and a;" respectively. Each term in 
the first power of (3) is now represented by some term in the first power of 
(7). Those terms in the first power of (7) which do not represent any term 
in the first power of (3), will of course have their coefficients p equal to 
zero, so that 

i'o + i>i + P2 + • • • + i^m = 1. (8) 

The coefficients in the expansion of (3) to the k power are not altered by 
multiplying all the exponents of the first power of the polynomial by a con- 
stant immber, or by adding a constant number to all of them, or by both 
these changes together. Hence it follows that the properties of the lever 
arm and the radius of gyration of the coefficients in powers and products of 
polynomials, demonstrated by me in the Analyst, March and May, 1880, 
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pp. 44 and 74, will hold good alike whether the exponents are positive or 
negative, integral or fractional, provided that we locate the coeificients along 
the axis or lever at distances from a given point which are equal or propor- 
tional to the corresponding exponents. The lever arm of the coefficients in 
the product of two or more polynomials, about the place of exponent zero, 
is equal to the sum of the lever arms of all the factors, about their respect- 
ive zero places. We represent by Jx that interval on the lever arm which 
corresponds to a diiference of unity in the exponents of (7). 

It may be well to prove the above directly for the two polynomials 

^_„2-"« + .. . +;8_2^-^+;8-ia-^+;8o«''+^i«''+/?22'^+ ••• +i5«a''", J ^ ^ 
which have both positive and negative exponents. Let the sum of the 
coefficients a in the first polynomial be /S^, and let their lever arm about 
the place of «o ^^ a fulcrum be h^. Also let the sum of the coefficients /9 
in the second polynomial be 8 2, and let their lever arm about the place of 
^0 be h^. The statical moments of the two systems about these fulcrums 
are 

'S'1^1 = (-a_^a„-«_^+ia^_i- . . . -a_iai +a.^b^ -f-a262+ • • • +«nfim)^x;, ) 
'S'2^2 = (-i3-« <'„-i9_,.+ iC„_i-...-/9_iCi-f^idi+/92Ci!2+. . . +^nd„)Jx. j 

(10) 

Now when the first one of the polynomials (9) is multiplied by the second, 
the moments about the place of olq^q, of the coefficients in the parts of tlie 
product due to j8q, fi^, fi^, . . . /3„, are 

^oSiK, ^^S^ihi+diJx), j328i{h^+d2Jx),...^„8^{hi+d„Jx), 
and those for the parts due to /9_i, /5_2, . . . /9_„ are 

^_i8i{hi—eiJx), /3_2/S'i(A.i— 02^a;), . . . i9_„^i(Ai— o„ Jx). 
The sum of all the coefficients in the product is 8^82, and denoting by H 
their lever arm about the place of ao/^o j '^^ have, by addition of the forego- 
ing moments, 

8i82H= ^i/ii(i3_„-f/9_„+i+ . . . +^-i+/9o+/5i+;82+ • • • +^n) 

+ 'Si(-/9_„c„-/9_„+iO„_i- . . -^-iOi+i9idi-l-^2'^2+ • • +Mn)^x, 
which by the help of (10) is reduced to 

S^S^H = /Si^2^h + '^1-82^2, 

.•.H=h^+h,, (11) 

so that the lever arm in the product is equal to the sum of the lever arms 
in the two factors. 

We will now make another statement and application of Lagrange's the- 
orem. An urn contains a number of balls, some of which are marked 0, 
others 1, others 2, &c., in such proportion that the probabilities of drawing 
a single ball marked 0, 1, 2, .... m are p^, p^, p^, • • . . p,„, respectively. 
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The occurrence of an error 6, under the first statement, is here represented 
by the drawing of a ball marked b-\-h. If each ball is returned as soon as 
drawn, and they are all mixed so as to keep the chances the same, then 
after k drawings, the probability that the sum of the numbers drawn shall 
be a given number s, is the coefficient of z" in the expansion of (7). Proof 
of this will be given farther on. The most probable shares which the num- 
bers 0, 1, 2, &c., will respectively have in the formation of the sum, are 

hp^XO, Jcp-^Xl, hp^X^, &c., kp^Xm, 
so that the most probable sum will be 

KPi + 2p2 + 3p3 + . . . + mp^). (12) 

This is the value, approximately, of the exponent of z in that term of the 
expansion whose coefficient is a maximum. It is in general approximate 
only, because it is not usually a whole number, as the exponents here are 
supposed to be. It will be seen that the portion of (12) within the paren- 
theses is the lever arm of the coefficients p in the first power of (7), about 
the place of p^ as a fulcrum, the constant interval Jx between the coeffi- 
cients being unity. This answers to the property noticed by Lagrange in 
his first corollary. I have shown further, that the lever arm of the coeffi- 
cients in the expansion to the k power will be k times what it is in the first 
power. Hence according to (12), the maximum coefficient in the expansion 
will be located approximately at the centre of forces, or centre of gravity, of 
the whole expanded series of coefficients, even when k is not a large num- 
ber. I have before shown that this is true at the limit, when k becomes 
large or infinite, and proved that the series of coefficients then takes the 
form of the probability curve. (Analyst, Sept. and Nov., 1879.) For a 
given value of k, the place or rank of the vertex is known and the curve 
can be constructed. Since k in my article last mentioned is in strictness in- 
finite, we may write k instead of k -{- 1. Also b^{dxf there represents the 
square of the radius of gyration r, of the coefficients of the first power of the 
polynomial, about their centre of forces. The equation of the curve then is 

^' = 2-¥^' 2/ = ^e— , (13) 

where dx is what the interval Ax reduces to when the coefficients, regarded 
as ordinates, are brought together so as to be consecutive, and kr^ is the 
square of the radius of gyration of the coefficients in the expanded poly- 
nomial, about their centre of forces. This radius is here seen to be identical 
with the quadratic mean error; as we have already noticed in the case of 
the expansion of the binomial (p+g)"*. (Analyst, Jan., 1880, pp. 8 and 
22.) The probable error, or probable deviation of the sum of the numbers 
drawn, from its most probable value, is 
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.6745(^),/L (14) 

III a special case, where the coefficients p are all equal, Laplace found, by a 
very different method, that the limiting form of the expansion is what we 
call the probability curve. {The'orie analytique des Probabilite's, edition of 
1847, pp. 161 and 333.) He took this as a basis for demonstrating the ex- 
ponential law of probability, and extended it to the case of given errors 
distributed according to any algebraical law, provided it be such that pos- 
itive and negative errors of equal amount are equally probable. An account 
of the method may be found in Airy's Theory of Errors of Observations, pp. 
7 to 15. De Moivre had first shown that the probability curve is the lim- 
iting form of the expansion of (IH- J)'". (Todhunter, p. 192.) The special 
case above named where the coeff. p are all equal, arises when when we make 
a number of throws with dice, and wish to find the probability the sum of 
the numbers thrown will be a given number, or, fall within given limits. 

The general theorem of Lagrange can be extended to include the case of 
n urns, which we will suppose to contain balls bearing either positive or 
negative numbers, in such proportions that the probabilities of drawing the 
several numbers Irom — m to -\-m are p'_m, • • • p'o) P'd V'ly • • • -P'm for 
the first urn, p'l^, . . . p^, p", p'^, . . . pZ for the second, etc., and p^'*,, . . , 
p'o\ P^r^ P^^ • • • Pm^ fo"" tJi6 '^th urn, m being the highest number, positive 
or negative, on any ball. Then if we draw one ball from each urn, the 
probability that the sum of all the numbers drawn shall be s, will be the 
coeiificient ofz' in the continued product of the polynomials 

pL^2-"+ . . . +p',+p\z +pW+ • • • +PL2'", 1 

p'l^z-'-'-^ . . . +p'i+p'iz +py+ . . . +p':z'^, ! ^jg^ 

pLi2-"*+ . . . +pi>+pfz+p<^v+'. .'.'-f-pw^'".* J 

For this coefficient of z^ is the sum of all the products of n factors p taken 
in the order p'p"p"' , . . p("), which can be formed in such manner that the 
sum of all the sub-indices in each product shall be s; and each of these pro- 
ducts is the probability of drawing, in that particular order, the numbers 
denoted by the sub-indices. If we draw k balls from a single urn, replac- 
ing each as before, we must merely suppose that Jc polynomials have like 
coefficients. Hence in general, if we make k' drawings from the first urn, 
k" from the second, and so on to the nth, the probability that the sum of 
all the numbers drawn shall be s, will be the coefficient of z' in the contin- 
ued product of the expansions of the polynomials 

(pL„a-»+ . . . +p'o +p\ z+p'^ z'+... +pizy ; 



(p<:is-'"+ p(^>+p«3-i-p§v-h . . . -t-p<:>2»')*(">. 



(16) 
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When the exponents h are tolerably large, there is no need to actually per- 
form the multiplications. My theorems respecting the lever arm and the 
radius of gyration of the coefficients enable us easily to find the centre of 
forces for the final product, giving the most probable value of s, and to find 
the radius of gyration about this centre, determining h in the probability 
curve (13). The proposition that the most probable value is at the centre 
of forces approximately, is also in accordance with the known theorem in 
mechanics, that the radius of gyration (here the mean error) is a minimum 
when the axis of rotation is taken through the centre of gravity. 

To illustrate the foregoing, suppose that two urns contain ten balls each. 
The first has four balls marked — 1, five marked 0, and one marked 2; 
while the second has one marked — 2, two marked — 1, four marked 1, and 
three marked 2. Twelve drawings are made from the first and eight from 
the second. The two polynomials are 

/ 42-i-|-5+ 2'\" /'2-2-|-2a-i+40-f322\8 

V 10 }' \ 10 j' ^> 

and the lever arms of the coefficients in their first powers, about the places 
of 2° as fulcrum, are 

^[4(_l)-|-5(0)+l(2)]i.,= -iJa., -I , 

^V[l(-2)+2(-l)+4(l)-t-3(2)] Jaj = f ia;. / ^ ^^ 

Hence, in the product of the expansions of (17), the lever arm, about the 
place of 2" as a fulcrum, is 

12(— ^Ja;)-}-8(|Ja!) = J^Ja;. (19) 

The integer nearest to ^ is 2, which is therefore the most probable alge- 
braic sum of the numbers drawn. The squares of the radii of gyration of 
the coefficients in the first powers of the polynomials (17), about the places 
of the extremity of the lever arm, or the centre of parallel forces, are 

TV[l(¥)^+2(|)^+4(|)^+3(i)^] {Axf ^ ^{Axf. f ^^> 

Therefore in the product of the expansions of (17), the square of the radius 
of gyration of the coefficients, about their centre of forces, is 

imm+^iw] i^^y = ■2SM{jxf. (21) 

The probable error then is 

.6745i/(23.84) = 3.29, 
so that it is about an even chance that the sum of the numbers drawn will 
fall within the limits i^ ± 3.29. 

The product of the expansions of (17) will approximate to the form of a 
probability curve. I have already shown that this is true for any high pow- 
er of a single polynomial whose coefficients are all positive, so that the two 
expansions in (17) will each be of that form. Now in the curve (13) the 
parameter h is constant so long as kr^ is constant, though h and r^ should 
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vary inversely as each other; and many different polynomials may give the 
same value of r*. Hence a curve of given parameter may be regarded as 
approximately representing the form of the expansion, to some power, of an 
indefinite number of different polynomials. The two expansions in (17) 
may thus be taken as expansions of some one undetermined polynomial, to 
different powers. Their product will thus be an expansion of that poly- 
nomial, and consequently will take the form of the probability curve. 

To construct its equation from (13), putting da; for Jx in the radius of 
gyration, we have 



h' = 



hdx == .14482, 



2x23.84(da;)'" 
and writing i = x-^dx, we get 

log y = log (Mb) — Jlog tt — i*(Ma;)'log e 
= 2.91226— .0091085*^. 

The vertex of the curve, where z = 0, is at the centre of forces In the 
product of the expansions of (17), and the rank of that centre from 2", as 
given in (19), is ^. Hence the place of the exponent 2 corresponds to 
i = — |-, so that if we assign to i the values — ^, — |-, f, f, &c., the re- 
sulting values of y will be the approximate probabilities that the sum of the 
numbers drawn will be 1, 2, 3, 4, &c., respectively. The larger the expo- 
nents h are in such cases, the more accurate will the results be. 

To show the general correctness of the above approximation, the compu- 
ted values of y, for each value of i, are shown in the subjoined table. In 
the column headed C are also placed the values of the coefficients in the 
product of the expansions (17), as found by actual multiplication. 
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that the series C is not perfectly smooth and regular, but its general agree- 
ment with the series y is very close. 

The probabilities that an event will happen or fail certain numbers of 
times in m trials, are represented, as is well known, by terms in the expan- 
sion of {^+qT. (Analyst, May 1879, p. 65.) This is the simplest case 
under our present theorem, where there is but a single urn, containing balls 
marked either or 1 for example, so that (16) reduces to 
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{v'o+V'i^T- (23) 

The coefficient of 2' in the expansion is the probability that the event 1 will 
happen s times, and consequently that the event will happen ¥ — s times. 

Reverting now to our first statement of Lagrange's theorem, as applied 
to the probable effect of a system of given errors a, b, c, &c., it appears that 
the most probable error in the mean result of h observations is the /cth part 
of that total error, or algebraic sum of the errors, whose probability occu- 
pies the centre of forces in the series of coefficients in the expansion of the 
polynomial (3), when the coefficients in this expansion are regarded as masses 
acting about the point of no error as a fulcrum, with lever arms equal to the 
total errors whose probabilities are represented by the coefficients. The 
probable sum of the total deviations from the centre of forces or centre of 
gravity on one side, is equal to the probable sum of those on the other, so 
that the most probable mean result of the Ic observations is the arithmetical 
mean of all the mean results possible under the given system of errors, each 
possible result having a weight proportional to the prob'y of its occurrence. 

If the algebraic sum of the given elementary errors, each weighted in pro- 
portion to the probability of its occurrence, is zero, the most probable mean 
result will be the true value of the observed quantity ; and not otherwise. 
The most probable result will be greater or less than the true value accord- 
ing as the + or — elementary errors preponderate. When these errors are 
unknown, we have no reason to expect a preponderance on one side rather 
than on the other, and hence we conclude that as far as the true value of art 
observed quantity can be inferred from the results of numerous observations, 
the arithmetical mean of all the observed values is the most likely to be the 
true value. It is to be presumed that the observed deviations from the 
mean will be so distributed as to represent approximately the distribution 
of all the possible deviations. 

As the number of observations is increased, the series of coeff's in the ex- 
pansion of (3) approaches rapidly, as I have shown, to the form of the prob- 
ability curve (13), and attains this form at the limit, when k becomes very 
large, or infinite. It appears to me that we have here the most general and 
complete proof of the validity of that exponential law of facility of error, or 
probability of deviation from the most probable value, whose geometrical 
expression is called the probability curve. The elementary errors are not 
restricted here by assuming that they follow an algebraical law of distribu- 
tion, nor are positive and negative errors of equal amount required to be 
equally probable. 

I will take a future occasion to show the application of similar principles 
in determining the law of errors in two dimensions, or errors in the position 
of points in a plane. 



